BEST AVAIIABLE COPY 

RAW SEQUENCE LISTING 
ERROR REPORT 



BIOlx^^HNOLOGY 
SYSTEMS 
BRANCH 




The Biotechnology Systems Branch of the Scientiflc and Technical Information 
Center (STIC) detected errors when processing the following computer readable 
form: 

Application Serial Number: Q 



Source: 

Date Processed by STIC: 



THE ATTACHED PRINTOUT EXPLAINS DETECTED ERRORS. 

PLEASE FORWARD THIS INFORMATION TO THE APPLICANT BY EITHER: 

1) INCLUDING A COPY OF THIS PRINTOUT IN YOUR NEXT COMMUNICATION TO THE 
APPLICANT, WITH A NOTICE TO COMPLY or, 

2) TELEPHONING APPLICANT AND FAXING A COPY OF THIS PRINTOUT, WITH A 
NOTICE TO COMPLY 

FOR CRF SUBMISSION QUESTIONS, PLEASE CONTACT MARK SPENCER, 703-308-4211 



FOR SEQUENCE RULES INTERPRETATION, PLEASE CONTACT ROBERT WAX, 703-308-4216. 
PATENTIN 2.1 c-mail help: patin21help@usDto.gov or phone 703-306-41 19 (R. Wax) 
PATENTIN 3.0 c-mail help: patin3help@uspto.gov or phone 703-306-4119 (R. Wax) 

TO REDUCE ERRORED SEQUENCE LISTINGS, PLEASE USE THE CHECKER 
VERSION 3>0 PROGRAM , ACCESSIBLE THROUGH THE U.S. PATENT AND 
TRADEMARK OFFICE WEBSITE. SEE BELOW: 



Checker Version 3.0 

The Checker Version 3,0 application is a state-of the-art Windows based software program 
employing a logical and intuitive user-interface to check whether a sequence listing is in 
compliance with format and content rules. Checker Version 3.0 works for sequence listings 
generated for the original version of 37 CFR §§1.821 - 1.825 effective October 1, 1990 (old 
rules) and the revised version (new rules) effective July 1, 1998 as well as World Intellectual 
Property Organization (WIPO) Standard ST.25. 

Checker Version 3.0 replaces the previous DOS-based version of Checker, and is Y2K- 
compliant. Checker allows public users to check sequence listings in Computer Readable form 
(CRF) before submitting them to the United States Patent and Trademark Office (USPTO). 
Use of Checker prior to filing the sequence listing is expected to result in fewer errored sequence 
listings, thus saving time and money. 



Checker Version 3.0 can be down loaded from the USPTO website at the following address: 



http://wwAV.uspto.gov/web/offices/pac/checker 



Raw Sequence Listing Error Summary 



ERROR DETECTED SUGGESTED CORRECTION 



SERIAL NUMBER: 



N: NEW RULES CASES: PLEASE DISREGARD ENGLISH "ALPHA- HEADERS, WHICH WERE INSERTED BY PTO SOFTWARE 

Wrapped Nucleics The number/text at the end of each line "wrapped" down to the next line. 

This may occur if your file was retrieved in a word processor after creating it. 
Please adjust your right margin to .3. as this will prevent 'trapping**. 



Wrapped Aminos The amino acid number/text at the end of each line "wrapped " down to the next line. 

This may occur if your file was retrieved in a word processor after creating it. 
Please adjust your right margin to .3, as this will prevent **wrapping". 



Incorrect Line Length The rules require that a line not exceed 72 characters in length. This includes spaces. 



fi/iisaligned Amino Acid 
Numbering 



The numbering under each 5th amino acid is misaligned. This may be caused by the use of tabs 
between the numbering. It is recommended to delete any tabs and use spacing between the numbers. 



Non-ASCII 



This file was not saved in ASCII (DOS) text, as required by the Sequence Rules. 

Please ensure your subsequent submission is saved in ASCII text so that it can l>e processed. 



Variable Length Sequence(s) contain n's or Xaa's which represented more than one residue. 

As per the mles, each n or Xaa can only represent a single residue. 

Please present the maximum number of each residue having variable length and 

indicate in the (ix) feature section that some may be missing. 



Patentin ver. 2.0 "bug" 



A "bug" in Patentin version 2.0 has caused the <220>-<223> section to be missing from amino acid 

sequence(s) . Normally, Patentin would automatically generate this section from the 

previously coded nucleic acid sequence. Please manually copy the relevant <220>-<223> section 
to the subsequent amino acid sequence. This applies primarily to the mandatory <220>-<223> 
sections for Artificial or Unknown sequences. 



Skipped Sequences 
(OLD RULES) 



Sequence(s) missing. If intentional, please use the following format for each skipped sequence: 

(2) INFORMATION FOR SEQ ID NO:X; 

(i) SEQUENCE CHARACTERISTICS:(Do not insert any headings under "SEQUENCE CHARACTERISTICS") 
(xi) SEQUENCE DESCRIPTION:SEQ ID NO:X: 
This sequence Is intentionally skipped 



Please also adjust the "(iii) NUMBER OF SEQUENCES:" response to include the skipped sequence{s). 



Skipped Sequences 
(NEW RULES) 



Sequence(s) missing. If intentional, please use the following format for each skipped sequence. 

<210> sequence Id number 
<400> sequence id number 
000 



Use of n's or Xaa's 
(NEW RULES) . 



Use of n's and/or Xaa's have been detected in the Sequence Listing. 
Use of <220> to <223> is MANDATORY if n's or Xaa's are present. 

In <220> to <223> section, please explain location of n or Xaa. and which residue n or Xaa represents. 



Use of "Artificial" Use of "Artificial" only as "<21 3> Organism" response is incomplete, per 1 .823(b). of New Sequence Rules. 

(NEW RULES) Valid response is Artificial Sequence. . 



Use of <220>Feature 
(NEW RULES) 



Sequence(s) are missing the <220>Feature and associated headings. 

Use of <220> to <223> is MANDATORY if <213>0RGAN1SM is "Artificial Sequence" or "Unknown" 
Please explain source of genetic material in <220> to <223> section. 

(See "Federal Register," 6/01/98, Vol. 63, No. 104, pp. 29631-32) (Sec. 1.823 of new Rules) 



Patentin ver. 2,0 "bug" 



Please do not use **Copy to Disk" function of Patentin version 2.0. This causes a corrupted 

file, resulting in missing mandatory numeric identifiers and responses (as indicated on raw sequence listing). 

Instead, please use "File Manager" or any other means to copy file to floppy disk. 



AMC - Biotechnoloav Systems Branch - 4/06/2001 
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OIPE 



RAW SEQUENCE LISTING DATE: 05/16/2001 

PATENT APPLICATION: US/09/847,081 TIME: 15:06:10 

Input Set : A:\DES,txt Does Not Comply 

Output Set: N:\cRF3\o5i62ooi\i84708i , raw CorrGctecl DiskettB Needed 

3 <110> APPLICANT: BAYER AG 

5 <120> TITLE OF INVENTION: DNA encoding the tobacco phytoene synthase 

7 <130> FILE REFERENCE: Le A 34 326 

r^U-/ 9 <140> CURRENT APPLICATION NUMBER: US/09/847,081 

isjpV 10 <141> CURRENT FILING DATE: 2001-05-02 

12 <160> NUMBER OF SEQ ID NOS : 10 

14 <170> SOFTWARE: Patentln Ver . 2.1 



ERRORED SEQUENCES 

16 <210> SEQ ID NO: 1 

17 <211> LENGTH: 1728 

18 <212> TYPE: DNA 

19 <213> ORGANISM: Nicotiana tabacum 
21 <220> FEATURE: 



22 <221> NAME/KEY 

23 <222> LOCATION 
25 <400> SEQUENCE 



CDS 

(244) . . (1566) 
1 

26 agaaacccag aaagaacaac aggttttgct tcttgttgat gagtgcattt gcctctgcttT 60 
E--> 28 gtgtaaggca aagtcggttc actttcttat atccgatttt tataatcgtt gaaattagtg 




29 120 

E--> 31 gatagactct agtggatatc tacaagtatt ggttttttga taaaataggc tgaggtgaga 

32 180 

E--> 34 aggtaacata aaggaaagac aaaaacttgg gaattgtttt agaccaccga ggtttcttgt 

35 240 

E--> 3 7 ttc atg age atg tct gtt get ttg ttg tgg gtt gtt tot ccc act tec 

38 288 

39 Met Ser Met Ser Val Ala Leu Leu Trp Val Val Ser Pro Thr Ser 

40 1 5 10 15 
E--> 42 gag gtc teg aat ggg aca gga ttg ttg gat tea gtc cga gaa gga aac 

43 336 

4 5 Glu Val Ser Asn Gly Thr Gly Leu Leu Asp Ser Val Arg Glu Gly Asn 
46 20 25 30 

E--> 4 8 cge gtc ttt gta tea tec agg ttc eta get cga gat agg aat ttg atg 

49 384 

51 Arg Val Phe Val Ser Ser Arg Phe Leu Ala Arg Asp Arg Asn Leu Met 

52 35 40 45 

E--> 54 tgg aat ggg aga ate aag aaa ggt iggg aga caa agg tgg aat ttt gge 
55 432 

57 Trp Asn Gly Arg He Lys Lys Gly Gly Arg Gin Arg Trp Asn Phe Gly 

58 50 55 60 

E--> 60 tct tta att get gat cea aga tat tea tge ttg ggt gga tea aga act 

61 480 

64 Ser Leu He Ala Asp Pro Arg Tyr Ser Cys Leu Gly Gly Ser Arg Thr 

65 65 70 75 

E--> 67 gaa aag gga age act ttc tct gta eag tee agt ttg gtg get age cea 
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RAW SEQUENCE LISTING DATE: 05/16/2001 

PATENT APPLICATION: US/09/847,081 TIME: 15:06:10 



Input Set : A:\DES.txt 

Output Set: N:\CRF3\05162001\I847081.raw 





68 


528 


































70 


Glu 


Lys 


Glv 


Ser 


Thr 


Phe 


Ser 


Val 


Gin 


Ser 


Ser 


Leu 


Val 


A 1 ;i 
nXd 


OCX 


Pro 




71 


80 










85 










90 












E--> 


73 


act 

k. 


crcra 
yy 


gaa 


atg 


act gtg 


tea 


tea 


y dy 


aaa 


aag 


gtg 


tat 
L>d 1* 


y d u 


gtg 


rrt a 
y Ud 




74 


576 


































76 


Ala 


Glv 


Glu 


Met 


Thr 


Val 


Ser 


Ser 


Glu 


Lys 


Lys 


Val 


xyx 


A CF» 


V d X 


V dx 




77 










100 










105 










xxu 




E--> 


79 


tta 


aag 


eag 


gca 


gt,i- 


tta 


gtg 


aag 


agg 


cag 


ctg 


aga 


tet 


aee 


gat 


gat 




80 


624 


































82 


Leu 


Lys 


Gin 


Ala 


Ala 


Leu 


Val 


Lys 


Arg 


Gin 


Leu 


Arg 




X JiX 


Asp 


Asp 




83 








115 










120 










X Z J 






E--> 


85 


tta 


gaa 


OtQ 

y ^y 


aag 


ecg 


y ct u 


att 


att 


ott 


cca 




aat 


ttg 


ggc 


ttrr 

ttg 


ttrr 

ttg 




86 


672 


































88 


Leu 


Glu 


Val 


Lys 


r J. vJ 


Asp 


He 


Val 


Val 


Pro 




A c n 


Leu 


vjxy 


Leu 


Leu 




89 






130 










135 










X *i U 








E--> 


91 


a^t 


gaa 


gca 


tat 


ga L. 


cgt 


tat 


ggc 


gaa 


rrt;t 
y ud 


trrt 

t-gt 


gca 


gag 


t at 
Ud L. 


gca 


aag 




92 


720 


































94 


Ser 


Glu 


Ala 


Tvr 




Arg 


Cy s 


Gly 


Glu 


val. 




r\Xa 


V5X U 


Tyr 


Ala 


Lys 




95 




145 










150 










155 










E- - > 


97 

98 


aca 

7 fi fl 

/DO 


ttt 


tae 


tta 


gga 


aee 


aag 


r»t ^ 


d uy 


ace 


cca 


gag 


aga 


aga 


aga 


get 




100 


Thr 


Phe 


Tvr 


Leu 




T" K 

i nr 


Lys 


Leu 


Met 


Thr 


r i \J 


ux u. 


Arg 


Arg 


Arg 


A 1 a 




101 


160 




















1 10 










1 7 
X / D 


E- - > 


103 


ate 


+" fin 


gca 


ata 




gtg 


tgg 


tgc 


agg 


aga 


acg 


gat 


gag 


c t U 


gtt 


/-ra -f- 

gat 




1 OA 
J. u *» 


O J. o 


































106 


lie 


ixp 


Ala 






V d J. 


Trp 


Cy s 


Arg 


Arg 


Thr 


Asp 


Glu 


Leu 


va J. 


Asp 




107 










1 fin 










185 










1 y u 




E--> 


109 


adc 
yy ^ 


cct 


aat 


gea 


tec 


eae 


ata 


a nt 


ecg 


caa 


get 


tta 


gat 


agg 


tgg 


gag 




110 


864 


































112 


Gly 


Pro 


Asn 


Ala 


Ser 


His 


He 


Thr 


Pro 


Gin 


Ala 


Leu 


Asp 


Arg 


Trp 


Pin 
\j± U 




113 








195 










200 










Z U _) 






E- - > 


115 


acc 


agg 


etg 


gaa 


ga U 




tt r* 


dy u 


ggg 


egg 


eea 


ttt 


gat 


atg 


^-1- -i- 
ctx. 


gat 




116 


912 


































118 


Thr 


Arg 


Leu 


Glu 


Asp 


He 


Phe 


Ser 


Gly Arg 


Pro 


Phe 


Asp 


1*1 tr L 


Leu 


Asp 




119 






210 










215 










220 








E- - > 


121 


get 


get 


tta 


tec 


gat 


act 


gtc 


tee 


aga 


ttt 


ect gtt gat 


att 


cag 


eea 




1 

±z ^ 


960 


































124 


Ala 


Ala 


Leu 


Ser 


Asp 


Thr 


Val 


Ser 


Arg 


Phe 


Pro 


Val 


Asp 


He 


Gin 


Pro 




125 




225 










230 










235 










E--> 


127 


ttc 


aga 


gat 


atg 


att 


gaa 


gga 


atg 


cgt 


atg 


gae 


ttg 


tgg 


aaa 


tee 


aga 




128 


1008 
































130 


Phe 


Arg 


Asp 


Met 


He 


Glu 


Gly 


Met 


Arg 


Met 


Asp 


Leu 


Trp 


Lys 


Ser 


Arg 




131 


240 










245 










250 










255 


E--> 


133 


tac 


aaa 


act 


ttc 


gat 


gag 


eta 


tat 


etc 


tat 


tgt 


tac 


tat 


gtt 


get 


ggt 




134 


1056 






























136 


Tyr. Lys 


Thr 


Phe 


Asp 


Glu 


Leu 


Tyr 


Leu 


Tyr 


Cys 


Tyr 


Tyr 


Val 


Ala 


Gly 




137 










260 










265 










270 




E--> 


139 


act 


gta 


gga 


ttg 


atg 


agt 


gtt 


cca 


gtt 


atg 


ggt 


att 


gea 


cct 


gaa 


tea 




14 0 


1104 
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RAW SEQUENCE LISTING DATE: 05/16/2001 

PATENT APPLICATION: US/09/847,081 TIME: 15:06:10 



Input Set : A:\DES.txt 

Output Set: N:\CRF3\05162001\I847081, raw 







142 


Thr Val 


Glv 


Leu 


Met 


Ser 


Val 


Pro 


Val 


Met Gly 


T 1 

X xts 


Ala 


Pro 

IT X u 


c.^ 11 

u X U 








14 3 






275 










280 










z 0 -J 








-> 


145 

J. *t o 


aag gca 

1152 


aca 


aca 


gag 


agt 


gta 


tat 


aat 


get 


get 


ttg 


get 


tta 


ggg 


Ctt 






148 


Lvs Ala 


Thr 


Thr 


Glu 


Ser 


Val 


Tvr 
±y X 


Asn 


Ala 


Ala 


Leu 


Ala 


Leu 


Gly 








14 9 




290 










295 










J u u 








E- 


-> 


151 

1 '^O 
XD Z 


gca aat 

X z u u 


caa 


eta 


ace 


aat 


ata 


etc 


aga 


gau 


gta 


gga 


gaa 


gat 


gee 


aga 






154 


Ala A Qn 
r\Xcl noli 


Gin 


Leu 


Thr 


A n 
sn 


X Xc 


Leu 


Arg 


Asp 


Val 


oxy 


n n 

0 X U 


Asp 


Ala 
rVX a 


Arg 






155 


305 










310 










J X J 










E- 


-> 


157 

J. J o 


aga gga 

1248 


aga 


gta 


tac 


ttg 


cct 


caa 


gat 


gaa 


tea 


gca 


cag 


gca 


ggg 


etc 






160 


Arg Gly 


rg 


Val 


lyx 


ijeu 


It X (J 


VjX 11 


Asp 


Glu 


Leu 


nXd 


Pin 


AXd 


oxy 


Leu 






161 


320 








325 










330 










R 

J J D 


E- 


-> 


163 


tec gac 

X Z 3 D 


gaa 


gac 


ata 


ttt 


get 


gga 


aga 


gtg 


acu 


gat 


aag 


tgg 


agg 


aac 






X D O 




V3X u 


Asp 


Tip 
X xc 


IT lie 


AT a 
rtX d 


\jxy 


Arg 


V Ct X 


± ilX 


Asp 


Lys 


Trp 


Arg 


Asn 






1 fi7 








T 4 0 

J *± VJ 










345 














E- 


-> 


169 

1 7 fl 


ttt atg 

X J *t *t 


aag 


aaa 


caa 


att 


cag 


agg 


gcg 


agg 


aaa 


ttc 


ttt 


gat 


gag 


tea 






172 


C 11.^ l ie? L. 


Lys 


Lys 


Gin 


lie 


Gin 


Arg 


Ala 


Arg 


Lys 


IT He 


IT life; 


Asp 


0 X u 


Ser 






1 7 

X / J 






J J J 










fin 










T fi ^ 
J D 0 






E- 


-> 


175 

1 7 fi 
J. / D 


gag aaa 

X J y 


ggt 


gtc 


aca 


gaa 


ctg 


gac 


tct 


get 


agt 


aga 


tgg 


cct 


gtg 


tta 






1 7 fl 


Glu Lys 


Kjxy 


VaX 


i nr 


VjXU 


Leu 


Asp 


Ser 


nXa 


OCX 


Arg 


Trp 


Pro 


vax 


Leu 






1 7 Q 

X / 3 




T 7n 










7 *^ 










"5 fin 
J 0 u 








E- 


_ > 


181 

X o z 


a /-» a ft/TT 

dVa'Ct y y 

1 A A n 

X ft 4 U 


ctg 


c g 




ua I- 


cgc 


aag 


d 


4. 4- _ 

ttg 


gac 


gag 


att 


gaa 


gee 


aac 






X O H 


1 ill. nXa 


Leu 


Leu 


Leu 


Tyr 


Arg 


Lys 


X xe 


Leu 


Asp 


Glu 


He 


Glu 


Ala 


Asn 






185 


ft S 
J O J 










T on 

J J U 










395 










E- 




1 fl7 

X o o 


gac tac 

1488 


aac 


aac 


ttc 


aca 


agg 


agg 


get 


tat 


gtt 


age 


aag 


cca 


aag 


aag 






190 


Asp Tyr 


Asn 


Asn 


Phe 


Thr 


Arg 


Arg 


Ala 


Tyr 


Val 


Ser 


Lys 


Pro 


Lys 


Lys 






1 Q1 

X 7 X 


400 








405 










410 










415 




- > 


1 Q*^ 

X ? J 

1 QA 
X 3 ft 


ctt etc 

1536 


ace 


ttg 


ecc 


att 


get 


tat 


gca 


aaa 


tct 


ctt 


gtg 


ccc 


cct 


aat 






196 


Leu Leu 


Thr 


Leu 


Pro 


He 


Ala 


Tyr 


Ala 


Lys 


Ser 


Leu 


Val 


Pro 


Pro 


Asn 






197 








420 










425 










430 




E- 


-> 


199 


aga act 


tec 


tct 


cca 


eta 


gca 


aag 


aca 


tga 


atgaagtagt tgagtcaatg 






200 


1586 


































202 


Arg Thr 


Ser 


Ser 


Pro 


Leu 


Ala 


Lys 


Thr 




















203 






435 










440 
















E- 


-> 


205 


agtattatac actaaagaaa ctcaggtact 


. tgtaaatgag 


atatettttg etaaatgtgt 






206 


1646 






























E- 


-> 


208 


ateatcaaaa gtagattgta aattcaatat 


gacaatctct 


tggtagaata ttttctccac 






209 


1706 






























E-. 


-> 


211 


actcatcaaa ccctcaagtg ag 
























212 


1728 


































307 


<210> SEQ ID 


NO: 


3 
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RAW SEQUENCE LISTING 

PATENT APPLICATION: US/09/847,081 



DATE: 05/16/2001 
TIME: 15:06:10 



Input Set : A; \DES.txt 

Output Set: N:\CRF3\05162001\I847081.raw 



308 <211> LENGTH: 1712 

309 <212> TYPE: DNA 

310 <213> ORGANISM: Nicotiana tabacum 
312 <220> FEATURE: 



313 <221> NAME/KEY 

314 <222> LOCATION 
316 <400> SEQUENCE 



CDS 

(333). .(1565) 
3 

W--> 317 cttgaagagt agcagcagca agcaagahaa ttaaagtggg ctatttbkka naagccattg 60 
E--> 319 ttacmagara attaagaagc caagamacag gttattttct acttgagtya ggaaaagttg 

320 120 

E--> 322 gtttgcttta tttgtgggct ttttataatc ttttttccac aagggaaagt gggtattttc 

323 180 

E--> 325 ttgaaagtgg atttagactc tagtgggaat ctactaggag taaatttatt aattttttat 

326 240 

E--> 3 28 aaattaagca gaggaaggaa ggaaacagaa aacagaaagt aagacaaaaa accttggaat 

329 300 

E--> 331 tgttttagaa agccaaggtt ttcctgttca aa atg tct gtt gcc ttg tta tgg 

332 353 

33 3 Met Ser Val Ala Leu Leu Trp 

334 1 5 

E--> 3 36 gtt gtt tea cct tgt gaa gtc tea aat ggg aca gga ttc ttg gat tea 

337 401 

339 Val Val Ser Pro Cys Glu Val Ser Asn Gly Thr Gly Phe Leu Asp Ser 

340 10 15 20 

E--> 342 gtc egg gag gga aae egg gtt ttt gat teg teg agg eat agg aat tta 

343 449 

345 Val Arg Glu Gly Asn Arg Val Phe Asp Ser Ser Arg His Arg Asn Leu 

346 25 30 35 

E--> 34 8 gtg tge aat gag aga aae aag aga ggt gtg aaa caa agg tgg aat ttt 

349 497 

351 Val Cys Asn Glu Arg Asn Lys Arg Gly Val Lys Gin Arg Trp Asn Phe 

352 40 45 50 55' 
E--> 3 54 ggt tet gta agg tet get atg gtg get aca ceg gcg gga gaa atg gcg 

355 545 

357 Gly Ser Val Arg Ser Ala Met Val Ala Thr Pro Ala Gly Glu Met Ala 

358 . 60 65 70 
E--> 3 60 aeg atg aca tea gaa eag atg gtt tat gat gtg gtt tta aaa caa gea 

361 593 . 

363 Thr Met Thr Ser Glu Gin Met Val Tyr Asp Val Val Leu Lys Gin Ala 

364 75 80 85 

E--> 366 get tta gtg aag agg eag ttg aga tet get gat gat tta gaa gtg aag 
367 641 

369 Ala Leu Val Lys Arg Gin Leu Arg Ser Ala Asp Asp Leu Glu Val Lys 

370 90 95 100 

E--> 372 ceg gag ate cct etc ece ggg aat ttg age ttg ttg agt gaa gca tat 

373 689 

375 Pro Glu lie Pro Leu Pro Gly Asn Leu Ser Leu Leu Ser Glu Ala Tyr 

376 105 110 115 

E--> 378 gat agg tgt agt gaa gta tgt gca gag tat gea aag aca ttt tae tth 
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T 7 Q 


7 7 
































w- 




J O 1 

■5 P O 
J O Z 


Asp 

1 on 


Arg 


Cys 


oer 




va± 

1 7 ^ 


Cys 


AX a 




Tyr. 


a1 a 

± J U 


Lys 


Txir. 


file 


Tyr 


Ya a 
Add 

X J 3 


E- 


-> 


384 

OOP 

J o D 


gga 

7 Q r 


acc 


atg 


yta 


atg 


act 


cca 


gag 


aga 


aga 


agg 
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gat 
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Met 
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Z DU 
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aat 


caa 
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433 


1169 


































435 


Glu 


Ser 


Val 


Tyr 


Asn 


Ala 


Ala 


Leu 


Ala 


Leu 


Gly 


He 


Ala 


Asn 


Gin 


Leu 






436 




265 










270 










275 










E- 


-> 


438 


acg 


aac 


ata 


etc 


aga 


gat 


gtt 


gga 


gaa 


gat 


gee 


aga 


aga 


gga 


aga 


gtc 






439 


1217 


































441 


Thr 


Asn 


He 


Leu 


Arg 


Asp 


Val 


Gly 


Glu 


Asp 


Ala 


Arg 


Arg 


Gly 


Arg 


val 






442 


280 










285 
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E- 


-> 
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tta 


cct 
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gat 


gaa 


tta 


gca 


cag 


gca 


ggt 
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1265 
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Tyr 


Leu 
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Gin 


Asp 


Glu 


Leu 


Ala 


Gin 


Ala 


Gly 


Leu 


Phe 


Asp 


Asp 


Asp 






448 










300 
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310 




E- 


-> 


450 

451 


ata 

1313 


ttt 


get 


gga 


aaa 


gtg 


act 


gat 


aag 


tgg 


aga 


age 


ttt 


atg 


aag 


aag 
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E--> 


456 

A ^1 
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<210> SEQ ID NO; 


: 4 


























c n A 


<211> LENGTH: 410 




























<212> TYPE: 


PRT 




























c n 


<213> ORGANISM: 


Nicotiana tabacum 


f- 
















dU4 


<400> SEQUENCE: 


4 


























c n c 


Met Ser 


Val 


Ala 


Leu 


Leu 


Trp 


Val 


Val 


Ser 


Pro 


Cys 


Glu 


Val 


Ser 


Asn 




506 


1 






5 










10 










15 






508 


Gly Thr 


Gly 


Phe 


Leu 


Asp 


Ser 


Val 


Arg 


Glu 


Gly 


Asn 


Arg 


Val 


Phe 


Asp 
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100 
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526 
527 
E--> 530 

531 
533 
534 
536 
537 
539 
540 
542 
543 
545 
546 
548 
549 
551 
552 
554 
555 
557 
558 
560 
561 
563 
564 
567 
568 
570 
571 
573 
574 
576 
577 
579 
580 
582 
583 
587 
588 
589 
590 
592 
593 
594 
596 
597 
E--> 599 
600 
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Ser Leu Leu Ser Glu Ala Tyr Asp Arg Cys Ser Glu Val Cys Ala Glu 

115 ^-^lao ^ ' — 



Tyr Ala Lys Thr Phe Tyr /xaa ycly Thr Met/Xaa^Met Thr Pro Glu Arg 
130 V-X 140 • 

Arg Arg Ala lie Trp Ala lie Tyr Val Trp Cys Arg Arg Thr Asp Glu 
145 150 155 160 

Leu Val Asp Gly Pro Asn Ala Ser His lie Thr Pro Gin Ala Leu Asp 

165 170 175 

Arg Trp Glu Asp Arg Leu Glu Asp Val Phe Ser Gly Arg Pro Phe Asp 

180 185 190 

Met Leu Asp Ala Ala Leu Ser Asp Thr Val Ser Lys Phe Pro Val Asp 

195 200 205 

lie Gin Pro Phe Arg Asp Met lie Glu Gly Met Arg Met Asp Leu Arg 

210 215 220 

Lys Ser Arg Tyr Arg Asn Phe Asp Glu Leu Tyr Leu Tyr Cys Tyr Tyr 
225 230 235 • 240 

Val Ala Gly Thr Val Gly Leu Met Ser Val Pro He Met Gly He Ala 

245 250 255 

Pro Asp Ser Lys Ala Thr Thr Glu Ser Val Tyr Asn Ala Ala Leu Ala 

260 265 270 

Leu Gly He Ala Asn Gin Leu Thr Asn He Leu Arg Asp Val Gly Glu 

275 280 285 

Asp Ala Arg Arg Gly Arg Val Tyr Leu Pro Gin Asp. Glu Leu Ala Gin 

290 295 300 

Ala Gly Leu Phe Asp Asp Asp He Phe Ala Gly Lys Val Thr Asp Lys 
305 310 315 320 

Trp Arg Ser Phe Met Lys Lys Gin He Gin Arg Ala Arg Lys Phe Phe 

325 330 335 

Asp Glu Ala Glu Glu Gly Val Thr Gin Leu Ser Ser Ala Ser Arg Trp 

340 345 350 

Pro Val Trp Ala Ser Leu Leu Leu Tyr Arg Gin He Leu Asp Glu He 

355 360 365 

Glu Ala Asn Asp Tyr Asn Asn Phe Thr Lys Arg Ala Tyr Val Ser Lys 

370 375 380 

Pro Lys Lys Leu He Ser Leu Pro He Ala Tyr Ala Lys Ser Leu Val 
385 390 395 400 

Pro Pro Thr Arg Thr Leu Val Thr Ser Ser 
405 410 
<210> SEQ ID NO: 5 
<211> LENGTH: 2205 
<212> TYPE: DNA 

<213> ORGANISM: Nicotiana tabacum 
<220> FEATURE: 



<221> NAME/KEY 
<222> LOCATION 
<400> SEQUENCE 



CDS 

(189) . . (1955) 
5 

ctggcatctt acatctgcca aatttctcat ttatagcatc tcctaatctt tagatacctt 60 
ttcttcttgt tttgtttttc tatccttcac ttcatgcttt cttgttttac ccatctcttc 

120 
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C T 7 
D J / 




0 u 










85 










y u 
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-> 
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get 
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gtg 
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etc 
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ca L. 
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gtg 
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ata 


Ud U 
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C /! 0 
0 4 z 


Axa 


vax 


OX (J. 


Leu 


Leu 


Asp 
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Gly 


His 


Glu 


Val 
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He 


T VT 


Glu 


Ser 






C /I T 
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105 










110 
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-> 
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C /I c 

b4 b 


agg 
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cct 
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ggt 


999 


aaa 


gtg 
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ttt 
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gat 
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egt 
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C y1 Q 
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Arg 
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Phe 


He 


Gly 


Gly 
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Gly 


Ser 


Phe 
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Arg 


Arg 


Gly 
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C O 

boz 
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614 


cac 


att 
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atg 
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ctg 


cat 
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tte 


ttt 
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tgc 
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aat 


aat 






654 


Asn 


His 


He 


Glu 


Met 


Gly 


Leu 


His 


Val 


Phe 


Phe 


Gly 


Cys 


Tyr 


Asn 


Asn 






655 








130 










135 










140 






E- 


-> 


657 

658 


ttg 

662 


tte 


cgt 


ttg 


tta 


aaa 


aag 


gtg 


ggt 


get 


gaa 


aaa 


aat 


ctg 


eta 


gtg 






660 


Leu 


Phe 


Arg 


Leu 


Leu 


Lys 
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Val 


Gly 


Ala 


Glu 
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Asn 


Leu 


Leu 


Val 






661 
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cgc 


ttt 


eca 


gtt 


gga 
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aat 
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Phe 


Arg 
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Val 


Gly 


Ala 


Pro 


Leu 


His 


Gly 


He 


Asn 


Ala 
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67 3 


175 
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185 
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E- 
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675 

67 6 


ttt 
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tta 


tet 


aee 


aat 


eag 


eta 


aag 


att 


tat 


gat 


aag 


get 


aga 


aat 


get 






678 


Phe 


Leu 


Ser 


Thr 


Asn 


Gin 


Leu 


Lys 


He 


Tyr 


Asp 


Lys 


Ala 


Arg 


Asn 


Ala 






679 










195 










200 
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z v ^ 




E- 


-> 
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gta 


get 


ctt 


gee 


ett 


agt 
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gtg 


gtg 


egg 


get 


tta 


gtt gat 


eea 


gat 






u o ^ 


O J *± 




































684 


Val 


Ala 


Leu 


Ala 


Leu 
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Pro 


Val 
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Arg 


Ala 


Leu 


V dx 




Pro 


Asp 






685 








210 










215 










9 9 0 
^ Z U 






E- 


_ > 


687 




etc CI 


tta 


eag 


eag 


ata 


egt gat 


eta 


gat 


ay u 


y Ud 


age 






gag 






u o o 


902 




































690 


Gly 


Ala 


Leu 


Gin 


Gin 




Arg 


Asp 


Leu 


Asp 




V a X 


OCX 


r lie 


Ser 


CK 11 
VjX U 






691 






225 










T "3 n 










9 "3 R 
Z J D 








E- 


- > 


693 


tgg 


ttt 


atg 


tet 


aaa 


gy u 


ggg 


acg 


egt get 


age 


ate 


eag 


agg 


atg 


tgg 






694 


950 




































696 


Trp 


Phe 


Met 


Ser 




vjxy 


Gly 


Thr 


Arg 


Ala 


C O 


X Xc 




Arg 




irp 






697 




240 
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E- 


_ > 
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gat 
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cct 




gca 


tat 


get 


ett: 


gga 


uue 


3 4- -H 


gac 


tot 


gac 


aau 


ate 


agt 
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Asp 
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E--> 74 7 tac aaa ttg gtt gga gtg cct gtt gtt acg gta caa eta cga tac aat 

748 1382 

750 Tyr Lys Leu Val Gly Val Pro Val Val Thr Val Gin Leu Arg Tyr Asn 

751 385 390 395 
E--> 753 ggc tgg gtt aca gag ttg cag gac ttg gag cgt teg agg eaa ttg aag 

754 1430 

756 Gly Trp Val Thr Glu Leu Gin Asp Leu Glu Arg Ser Arg Gin Leu Lys 

757 400 405 410 
E--> 759 egc get aca ggt ttg gac aat etc ctg tat aea cea gat gca gat ttc 

760 1478 

762 Arg Ala Thr Gly Leu Asp Asn Leu Leu Tyr Thr Pro Asp Ala Asp Phe 

763 415 420 425 430 
E--> 765 tet tge ttt geg gac ctt gca ttg gca tct cet gaa gat tat tac att 

766 1526 

768 Ser Cys Phe Ala Asp Leu Ala Leu Ala Ser Pro Glu Asp Tyr Tyr lie 

769 435 440 445 
E--> 771 gag ggc caa ggc tea ttg ett caa tgt gte ctt aca cct ggt gae ect 

772 1574 

774 Glu Gly Gin Gly Ser Leu Leu Gin Cys Val Leu Thr Pro Gly Asp Pro 

775 450 455 460 
E--> 777 tac atg cct eta eta aat gat gaa ate ata aaa aga gtg tea aag cag 

778 1622 

780 Tyr Met Pro Leu Leu Asn Asp Glu He He Lys Arg Val Ser Lys Gin 

781 465 ■ 470 475 
E--> 783 gtt ttg gca eta ttt ect tct tec eaa ggt ctt gag gtt ace tgg tea 

784 1670 

786 Val Leu Ala Leu Phe Pro Ser Ser Gin Gly Leu Glu Val Thr Trp Ser 

787 480 485 490 
E--> 789 tea gtt gtg aaa att ggg caa tee eta tat cgt gaa gga cct ggt aaa 

790 1718 

792 Ser Val Val Lys He Gly Gin Ser Leu Tyr Arg Glu Gly Pro Gly Lys 

793 495 500 505 510 
E--> 795 gae cea tte aga cct gat eag aag act cca gtg gaa aat ttc ttt ett 

796 1766 

798 Asp Pro Phe Arg Pro Asp Gin Lys Thr Pro Val Glu Asn Phe Phe Leu 

799 515 520 525 
E--> 801 get ggc tea tat aea aaa eag gac tac ata gat age atg gaa ggg gca 

802 1814 

804 Ala Gly Ser Tyr Thr Lys Gin. Asp Tyr He Asp Ser Met Glu Gly Ala 

805 530 535 540 
E--> 807 act ctt tea ggt agg eaa gca tct gea tac gta tgt gat get ggc gag 

808 1862 

810 Thr Leu Ser Gly Arg Gin Ala Ser Ala Tyr Val Cys Asp Ala Gly Glu 

811 545 . 550 555 
iG--> 813 aag ctg gtg gtg ttg egg aaa aag att get get get gag tea aac gag 

814 1910 

816 Lys Leu Val Val Leu Arg Lys Lys He Ala Ala Ala Glu Ser Asn Glu 

817 . 560 565 570 
E--> 819 ate tet gaa ggt gta tea gta tet gat gag ttg agt ett gtc tga 
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820 1955 

822 lie Ser Glu Gly Val Ser Val Ser Asp Glu Leu Ser Leu Val 

823 575 580 585 

E--> 825 tgactggaaa tcatccaatg aatactgaag agcacccccc actttgttaa tccgagaagc 

826 2015 

E--> 828 agatacaaac ataactcagt taggcattgc gtaaggaaga gttcttctaa attttgagtt 

829 2075 

E--> 831 cacaagatgg aaatcaaaag gttaaaatat gttgtatgta atattagtaa atcttcatag 

832 2135 

E--> 834 tgatgtatct attctgccac ccttcaggtt tagtgaaatg gatcgtattg ctcatcattc 

835 2195 
E--> 837 attgagaaga 

838 2205 

987 <210> SEQ ID NO: 10 

988 <211> LENGTH: 23 

989 <212> TYPE: DNA 

990 <213> ORGANISM: Nicotiana tabacum 

992 <400> SEQUENCE: 10 

993 tacccgcyqct aaac tacgct tgc 23 
E--> 995 /Le^a 34 326 " 
W--> 100(1 Le A 34 326-Ausland 
E--> lOoi - 17 - 
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VERIFICATION SUMMARY 

PATENT APPLICATION: US/09/847,081 



DATE: 05/16/2001 
TIME: 15:06:12 



Input Set : A:\DES.txt 

Output Set: N:\CRF3\05162001\I847a81.raw 



L:9 M:270 C: Current Application Number differs, Replaced Application Number 
L:10 M:271 C: Current Filing Date differs, Replaced Current Filing Date 
L:28 M:254 E: No. of Bases conflict, LENGTH : Input : 0 Counted:120 SEQ : 1 
M:254 Repeated in SeqNo=l 

L:317 M:258 W: Mandatory Feature missing, <223> not found for SEQ ID# : 3 
L:317 M:341 W: (46) "n" or "Xaa" used, for SEQ ID# : 3 

L:319 M:254 E: No. of Bases conflict, LENGTH : Input : 0 Counted:120 SEQ : 3 
M:254 Repeated in SeqNo=3 

L:381 M:258 W: Mandatory Feature missing, <223> not found for SEQ ID*: 3 
L:381 M:341 W: (46) "n" or "Xaa" used, for SEQ ID# : 3 

L:387 M:258 W: Mandatory Feature missing, <223> not found for SEQ ID# : 3 
L:387 M:341 W: (46) "n" or "Xaa" used, for SEQ ID# : 3 

L:530 M:340 E: (46) "n" or "Xaa" used: Feature required, for SEQ iD# : 4 
L:599 M:254 E: No. of Bases conflict, LENGTH : Input : 0 Counted: 120 SEQ: 5 
M:254 Repeated in SeqNo=5 

L:995 M:254 E: No. of Bases conflict, LENGTH: Input : 326 Counted:26 SEQ:10 
L:995 M:320 E: (1) Wrong Nucleic Acid Designator, NUMBER OF INVALID KEYS:2 
L:995 M:112 C: (48) String data converted to lower case, 

L:1000 M:334 W: (2) Invalid Amino Acid in Coding Region, NUMBER OF INVALID KEYS : 4 
L:1003 M:336 W: Invalid Amino Acid Number in Coding Region, SEQ ID: 10 
L:1003 M:252 E: No, of Seq . differs, <211>LENGTH : Input : 23 Found: 26 SEQ: 10 
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